Overview

Brought to you by YData

Dataset statistics

Number of variables23
Number of observations3677
Missing cells6710
Missing cells (%)7.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 MiB
Average record size in memory632.1 B

Variable types

Categorical10
Text3
Numeric10

Alerts

area is highly overall correlated with bathroom and 5 other fieldsHigh correlation
bathroom is highly overall correlated with area and 5 other fieldsHigh correlation
bedRoom is highly overall correlated with area and 5 other fieldsHigh correlation
built_up_area is highly overall correlated with area and 4 other fieldsHigh correlation
carpet_area is highly overall correlated with area and 5 other fieldsHigh correlation
facing is highly overall correlated with built_up_areaHigh correlation
price is highly overall correlated with area and 7 other fieldsHigh correlation
price_per_sqft is highly overall correlated with priceHigh correlation
property_type is highly overall correlated with bedRoom and 2 other fieldsHigh correlation
servant room is highly overall correlated with bathroom and 1 other fieldsHigh correlation
super_built_up_area is highly overall correlated with area and 7 other fieldsHigh correlation
store room is highly imbalanced (55.7%)Imbalance
facing has 1045 (28.4%) missing valuesMissing
super_built_up_area has 1802 (49.0%) missing valuesMissing
built_up_area has 1987 (54.0%) missing valuesMissing
carpet_area has 1805 (49.1%) missing valuesMissing
area is highly skewed (γ1 = 29.73095613)Skewed
built_up_area is highly skewed (γ1 = 40.70657243)Skewed
carpet_area is highly skewed (γ1 = 24.33323909)Skewed
floorNum has 129 (3.5%) zerosZeros
luxury_score has 462 (12.6%) zerosZeros

Reproduction

Analysis started2024-09-05 05:57:24.837173
Analysis finished2024-09-05 05:58:01.259139
Duration36.42 seconds
Software versionydata-profiling vv4.9.0
Download configurationconfig.json

Variables

property_type
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size248.6 KiB
flat
2818 
house
859 

Length

Max length5
Median length4
Mean length4.2336144
Min length4

Characters and Unicode

Total characters15567
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowflat
2nd rowflat
3rd rowflat
4th rowhouse
5th rowflat

Common Values

ValueCountFrequency (%)
flat 2818
76.6%
house 859
 
23.4%

Length

2024-09-05T11:28:01.552718image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:01.874227image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
flat 2818
76.6%
house 859
 
23.4%

Most occurring characters

ValueCountFrequency (%)
f 2818
18.1%
l 2818
18.1%
a 2818
18.1%
t 2818
18.1%
h 859
 
5.5%
o 859
 
5.5%
u 859
 
5.5%
s 859
 
5.5%
e 859
 
5.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15567
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
f 2818
18.1%
l 2818
18.1%
a 2818
18.1%
t 2818
18.1%
h 859
 
5.5%
o 859
 
5.5%
u 859
 
5.5%
s 859
 
5.5%
e 859
 
5.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15567
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
f 2818
18.1%
l 2818
18.1%
a 2818
18.1%
t 2818
18.1%
h 859
 
5.5%
o 859
 
5.5%
u 859
 
5.5%
s 859
 
5.5%
e 859
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15567
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
f 2818
18.1%
l 2818
18.1%
a 2818
18.1%
t 2818
18.1%
h 859
 
5.5%
o 859
 
5.5%
u 859
 
5.5%
s 859
 
5.5%
e 859
 
5.5%
Distinct676
Distinct (%)18.4%
Missing1
Missing (%)< 0.1%
Memory size293.9 KiB
2024-09-05T11:28:02.525108image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Length

Max length49
Median length39
Mean length16.869695
Min length1

Characters and Unicode

Total characters62013
Distinct characters41
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique308 ?
Unique (%)8.4%

Sample

1st rowumang monsoon breeze
2nd rowireo skyon
3rd rowdlf regal gardens
4th rowindependent
5th rowdlf the arbour
ValueCountFrequency (%)
independent 491
 
5.1%
the 350
 
3.6%
dlf 220
 
2.3%
park 209
 
2.2%
city 166
 
1.7%
emaar 155
 
1.6%
global 153
 
1.6%
m3m 152
 
1.6%
signature 150
 
1.6%
heights 134
 
1.4%
Other values (783) 7497
77.5%
2024-09-05T11:28:03.960505image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 6710
 
10.8%
6003
 
9.7%
a 5861
 
9.5%
r 4171
 
6.7%
n 4163
 
6.7%
i 3830
 
6.2%
t 3719
 
6.0%
s 3472
 
5.6%
l 2943
 
4.7%
o 2755
 
4.4%
Other values (31) 18386
29.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 62013
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 6710
 
10.8%
6003
 
9.7%
a 5861
 
9.5%
r 4171
 
6.7%
n 4163
 
6.7%
i 3830
 
6.2%
t 3719
 
6.0%
s 3472
 
5.6%
l 2943
 
4.7%
o 2755
 
4.4%
Other values (31) 18386
29.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 62013
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 6710
 
10.8%
6003
 
9.7%
a 5861
 
9.5%
r 4171
 
6.7%
n 4163
 
6.7%
i 3830
 
6.2%
t 3719
 
6.0%
s 3472
 
5.6%
l 2943
 
4.7%
o 2755
 
4.4%
Other values (31) 18386
29.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 62013
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 6710
 
10.8%
6003
 
9.7%
a 5861
 
9.5%
r 4171
 
6.7%
n 4163
 
6.7%
i 3830
 
6.2%
t 3719
 
6.0%
s 3472
 
5.6%
l 2943
 
4.7%
o 2755
 
4.4%
Other values (31) 18386
29.6%

sector
Text

Distinct113
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size266.9 KiB
2024-09-05T11:28:04.520161image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Length

Max length26
Median length9
Mean length9.3209138
Min length7

Characters and Unicode

Total characters34273
Distinct characters31
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowsector 78
2nd rowsector 60
3rd rowsector 90
4th rowsector 2
5th rowsector 63
ValueCountFrequency (%)
sector 3452
46.8%
road 178
 
2.4%
sohna 166
 
2.2%
85 108
 
1.5%
102 107
 
1.4%
92 100
 
1.4%
69 93
 
1.3%
90 89
 
1.2%
65 87
 
1.2%
81 87
 
1.2%
Other values (106) 2915
39.5%
2024-09-05T11:28:05.474446image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 3807
11.1%
3705
10.8%
s 3697
10.8%
r 3697
10.8%
e 3542
10.3%
c 3503
10.2%
t 3463
10.1%
1 1076
 
3.1%
0 804
 
2.3%
8 780
 
2.3%
Other values (21) 6199
18.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 34273
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 3807
11.1%
3705
10.8%
s 3697
10.8%
r 3697
10.8%
e 3542
10.3%
c 3503
10.2%
t 3463
10.1%
1 1076
 
3.1%
0 804
 
2.3%
8 780
 
2.3%
Other values (21) 6199
18.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 34273
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 3807
11.1%
3705
10.8%
s 3697
10.8%
r 3697
10.8%
e 3542
10.3%
c 3503
10.2%
t 3463
10.1%
1 1076
 
3.1%
0 804
 
2.3%
8 780
 
2.3%
Other values (21) 6199
18.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 34273
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 3807
11.1%
3705
10.8%
s 3697
10.8%
r 3697
10.8%
e 3542
10.3%
c 3503
10.2%
t 3463
10.1%
1 1076
 
3.1%
0 804
 
2.3%
8 780
 
2.3%
Other values (21) 6199
18.1%

price
Real number (ℝ)

HIGH CORRELATION 

Distinct473
Distinct (%)12.9%
Missing17
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean2.5336639
Minimum0.07
Maximum31.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:05.919343image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0.07
5-th percentile0.37
Q10.95
median1.52
Q32.75
95-th percentile8.5
Maximum31.5
Range31.43
Interquartile range (IQR)1.8

Descriptive statistics

Standard deviation2.9806235
Coefficient of variation (CV)1.1764084
Kurtosis14.933373
Mean2.5336639
Median Absolute Deviation (MAD)0.72
Skewness3.2791705
Sum9273.21
Variance8.8841164
MonotonicityNot monotonic
2024-09-05T11:28:06.357034image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.25 80
 
2.2%
1.2 64
 
1.7%
1.5 64
 
1.7%
0.9 63
 
1.7%
1.1 62
 
1.7%
1.4 60
 
1.6%
1.3 57
 
1.6%
2 52
 
1.4%
0.95 52
 
1.4%
1.6 48
 
1.3%
Other values (463) 3058
83.2%
ValueCountFrequency (%)
0.07 1
 
< 0.1%
0.16 1
 
< 0.1%
0.17 1
 
< 0.1%
0.19 1
 
< 0.1%
0.2 8
0.2%
0.21 6
0.2%
0.22 8
0.2%
0.23 1
 
< 0.1%
0.24 6
0.2%
0.25 11
0.3%
ValueCountFrequency (%)
31.5 1
 
< 0.1%
27.5 1
 
< 0.1%
26 2
0.1%
25 1
 
< 0.1%
24 1
 
< 0.1%
23 1
 
< 0.1%
22 1
 
< 0.1%
20 3
0.1%
19.5 2
0.1%
19 3
0.1%

price_per_sqft
Real number (ℝ)

HIGH CORRELATION 

Distinct2651
Distinct (%)72.4%
Missing17
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean13892.668
Minimum4
Maximum600000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:06.793888image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile4715.95
Q16817.25
median9020
Q313880.5
95-th percentile33333
Maximum600000
Range599996
Interquartile range (IQR)7063.25

Descriptive statistics

Standard deviation23210.067
Coefficient of variation (CV)1.6706702
Kurtosis186.92801
Mean13892.668
Median Absolute Deviation (MAD)2794
Skewness11.43719
Sum50847166
Variance5.3870722 × 108
MonotonicityNot monotonic
2024-09-05T11:28:07.220568image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000 27
 
0.7%
8000 19
 
0.5%
5000 17
 
0.5%
12500 14
 
0.4%
6666 13
 
0.4%
22222 13
 
0.4%
11111 13
 
0.4%
7500 12
 
0.3%
8333 12
 
0.3%
33333 11
 
0.3%
Other values (2641) 3509
95.4%
(Missing) 17
 
0.5%
ValueCountFrequency (%)
4 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
53 1
< 0.1%
57 1
< 0.1%
58 2
0.1%
60 1
< 0.1%
61 1
< 0.1%
79 1
< 0.1%
ValueCountFrequency (%)
600000 1
< 0.1%
400000 1
< 0.1%
315789 1
< 0.1%
308333 1
< 0.1%
290948 1
< 0.1%
283333 1
< 0.1%
266666 1
< 0.1%
261194 1
< 0.1%
245398 1
< 0.1%
241666 1
< 0.1%

area
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct1312
Distinct (%)35.8%
Missing17
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean2888.3311
Minimum50
Maximum875000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:07.665339image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile518.85
Q11232.25
median1733
Q32300
95-th percentile4246.2
Maximum875000
Range874950
Interquartile range (IQR)1067.75

Descriptive statistics

Standard deviation23167.506
Coefficient of variation (CV)8.0210699
Kurtosis942.02903
Mean2888.3311
Median Absolute Deviation (MAD)533
Skewness29.730956
Sum10571292
Variance5.3673333 × 108
MonotonicityNot monotonic
2024-09-05T11:28:08.086883image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1650 54
 
1.5%
1350 48
 
1.3%
1800 47
 
1.3%
3240 43
 
1.2%
1950 43
 
1.2%
2700 39
 
1.1%
900 38
 
1.0%
2000 33
 
0.9%
2250 25
 
0.7%
2400 23
 
0.6%
Other values (1302) 3267
88.8%
ValueCountFrequency (%)
50 4
0.1%
55 1
 
< 0.1%
56 1
 
< 0.1%
57 1
 
< 0.1%
60 2
0.1%
61 1
 
< 0.1%
67 2
0.1%
70 1
 
< 0.1%
72 1
 
< 0.1%
76 1
 
< 0.1%
ValueCountFrequency (%)
875000 1
< 0.1%
642857 1
< 0.1%
620000 1
< 0.1%
566667 1
< 0.1%
215517 1
< 0.1%
98978 1
< 0.1%
82781 1
< 0.1%
65517 2
0.1%
65261 1
< 0.1%
58228 1
< 0.1%
Distinct2355
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Memory size428.2 KiB
2024-09-05T11:28:08.796886image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Length

Max length124
Median length119
Mean length54.236062
Min length12

Characters and Unicode

Total characters199426
Distinct characters35
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1849 ?
Unique (%)50.3%

Sample

1st rowBuilt Up area: 1239 (115.11 sq.m.)Carpet area: 1100 sq.ft. (102.19 sq.m.)
2nd rowSuper Built up area 1524(141.58 sq.m.)Built Up area: 1250 sq.ft. (116.13 sq.m.)Carpet area: 921 sq.ft. (85.56 sq.m.)
3rd rowSuper Built up area 1744(162.02 sq.m.)
4th rowCarpet area: 3250 (301.93 sq.m.)
5th rowBuilt Up area: 3956 (367.52 sq.m.)Carpet area: 2200 sq.ft. (204.39 sq.m.)
ValueCountFrequency (%)
area 5573
18.5%
sq.m 3655
12.1%
up 3020
 
10.0%
built 2316
 
7.7%
super 1875
 
6.2%
sq.ft 1751
 
5.8%
sq.m.)carpet 1185
 
3.9%
sq.m.)built 702
 
2.3%
carpet 683
 
2.3%
plot 681
 
2.3%
Other values (2846) 8700
28.9%
2024-09-05T11:28:09.990489image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26464
 
13.3%
. 20389
 
10.2%
a 13154
 
6.6%
r 9456
 
4.7%
e 9320
 
4.7%
1 9205
 
4.6%
s 7567
 
3.8%
q 7431
 
3.7%
t 7324
 
3.7%
u 6770
 
3.4%
Other values (25) 82346
41.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 199426
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
26464
 
13.3%
. 20389
 
10.2%
a 13154
 
6.6%
r 9456
 
4.7%
e 9320
 
4.7%
1 9205
 
4.6%
s 7567
 
3.8%
q 7431
 
3.7%
t 7324
 
3.7%
u 6770
 
3.4%
Other values (25) 82346
41.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 199426
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
26464
 
13.3%
. 20389
 
10.2%
a 13154
 
6.6%
r 9456
 
4.7%
e 9320
 
4.7%
1 9205
 
4.6%
s 7567
 
3.8%
q 7431
 
3.7%
t 7324
 
3.7%
u 6770
 
3.4%
Other values (25) 82346
41.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 199426
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
26464
 
13.3%
. 20389
 
10.2%
a 13154
 
6.6%
r 9456
 
4.7%
e 9320
 
4.7%
1 9205
 
4.6%
s 7567
 
3.8%
q 7431
 
3.7%
t 7324
 
3.7%
u 6770
 
3.4%
Other values (25) 82346
41.3%

bedRoom
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3600761
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:10.419701image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q12
median3
Q34
95-th percentile6
Maximum21
Range20
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.8976289
Coefficient of variation (CV)0.56475771
Kurtosis18.212873
Mean3.3600761
Median Absolute Deviation (MAD)1
Skewness3.4851418
Sum12355
Variance3.6009954
MonotonicityNot monotonic
2024-09-05T11:28:10.830726image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
3 1496
40.7%
2 942
25.6%
4 660
17.9%
5 210
 
5.7%
1 124
 
3.4%
6 74
 
2.0%
9 41
 
1.1%
8 30
 
0.8%
12 28
 
0.8%
7 28
 
0.8%
Other values (9) 44
 
1.2%
ValueCountFrequency (%)
1 124
 
3.4%
2 942
25.6%
3 1496
40.7%
4 660
17.9%
5 210
 
5.7%
6 74
 
2.0%
7 28
 
0.8%
8 30
 
0.8%
9 41
 
1.1%
10 20
 
0.5%
ValueCountFrequency (%)
21 1
 
< 0.1%
20 1
 
< 0.1%
19 2
 
0.1%
18 2
 
0.1%
16 12
0.3%
14 1
 
< 0.1%
13 4
 
0.1%
12 28
0.8%
11 1
 
< 0.1%
10 20
0.5%

bathroom
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4245309
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:11.195058image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q12
median3
Q34
95-th percentile6
Maximum21
Range20
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.9480681
Coefficient of variation (CV)0.56885693
Kurtosis17.542297
Mean3.4245309
Median Absolute Deviation (MAD)1
Skewness3.2488298
Sum12592
Variance3.7949693
MonotonicityNot monotonic
2024-09-05T11:28:11.558130image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
3 1077
29.3%
2 1047
28.5%
4 820
22.3%
5 294
 
8.0%
1 156
 
4.2%
6 117
 
3.2%
9 41
 
1.1%
7 40
 
1.1%
8 25
 
0.7%
12 22
 
0.6%
Other values (9) 38
 
1.0%
ValueCountFrequency (%)
1 156
 
4.2%
2 1047
28.5%
3 1077
29.3%
4 820
22.3%
5 294
 
8.0%
6 117
 
3.2%
7 40
 
1.1%
8 25
 
0.7%
9 41
 
1.1%
10 9
 
0.2%
ValueCountFrequency (%)
21 1
 
< 0.1%
20 3
 
0.1%
18 4
 
0.1%
17 3
 
0.1%
16 8
 
0.2%
14 2
 
0.1%
13 4
 
0.1%
12 22
0.6%
11 4
 
0.1%
10 9
0.2%

balcony
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size238.1 KiB
3+
1172 
3
1074 
2
884 
1
365 
0
182 

Length

Max length2
Median length1
Mean length1.3187381
Min length1

Characters and Unicode

Total characters4849
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row2
5th row3

Common Values

ValueCountFrequency (%)
3+ 1172
31.9%
3 1074
29.2%
2 884
24.0%
1 365
 
9.9%
0 182
 
4.9%

Length

2024-09-05T11:28:11.978330image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:12.370995image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
3 2246
61.1%
2 884
 
24.0%
1 365
 
9.9%
0 182
 
4.9%

Most occurring characters

ValueCountFrequency (%)
3 2246
46.3%
+ 1172
24.2%
2 884
 
18.2%
1 365
 
7.5%
0 182
 
3.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4849
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3 2246
46.3%
+ 1172
24.2%
2 884
 
18.2%
1 365
 
7.5%
0 182
 
3.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4849
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3 2246
46.3%
+ 1172
24.2%
2 884
 
18.2%
1 365
 
7.5%
0 182
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4849
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3 2246
46.3%
+ 1172
24.2%
2 884
 
18.2%
1 365
 
7.5%
0 182
 
3.8%

floorNum
Real number (ℝ)

ZEROS 

Distinct43
Distinct (%)1.2%
Missing19
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean6.7982504
Minimum0
Maximum51
Zeros129
Zeros (%)3.5%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:12.773808image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median5
Q310
95-th percentile18
Maximum51
Range51
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.0124542
Coefficient of variation (CV)0.884412
Kurtosis4.5153928
Mean6.7982504
Median Absolute Deviation (MAD)3
Skewness1.6936988
Sum24868
Variance36.149606
MonotonicityNot monotonic
2024-09-05T11:28:13.187719image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
3 498
13.5%
2 493
13.4%
1 351
 
9.5%
4 316
 
8.6%
8 195
 
5.3%
6 183
 
5.0%
10 179
 
4.9%
7 176
 
4.8%
5 169
 
4.6%
9 161
 
4.4%
Other values (33) 937
25.5%
ValueCountFrequency (%)
0 129
 
3.5%
1 351
9.5%
2 493
13.4%
3 498
13.5%
4 316
8.6%
5 169
 
4.6%
6 183
 
5.0%
7 176
 
4.8%
8 195
 
5.3%
9 161
 
4.4%
ValueCountFrequency (%)
51 1
 
< 0.1%
45 1
 
< 0.1%
44 1
 
< 0.1%
43 2
0.1%
40 1
 
< 0.1%
39 2
0.1%
38 1
 
< 0.1%
35 2
0.1%
34 2
0.1%
33 4
0.1%

facing
Categorical

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)0.3%
Missing1045
Missing (%)28.4%
Memory size250.0 KiB
East
623 
North-East
623 
North
387 
West
249 
South
231 
Other values (3)
519 

Length

Max length10
Median length5
Mean length6.8381459
Min length4

Characters and Unicode

Total characters17998
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEast
2nd rowEast
3rd rowNorth
4th rowSouth
5th rowWest

Common Values

ValueCountFrequency (%)
East 623
16.9%
North-East 623
16.9%
North 387
 
10.5%
West 249
 
6.8%
South 231
 
6.3%
North-West 193
 
5.2%
South-East 173
 
4.7%
South-West 153
 
4.2%
(Missing) 1045
28.4%

Length

2024-09-05T11:28:13.600808image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:13.964400image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
east 623
23.7%
north-east 623
23.7%
north 387
14.7%
west 249
 
9.5%
south 231
 
8.8%
north-west 193
 
7.3%
south-east 173
 
6.6%
south-west 153
 
5.8%

Most occurring characters

ValueCountFrequency (%)
t 3774
21.0%
s 2014
11.2%
o 1760
9.8%
h 1760
9.8%
E 1419
 
7.9%
a 1419
 
7.9%
N 1203
 
6.7%
r 1203
 
6.7%
- 1142
 
6.3%
W 595
 
3.3%
Other values (3) 1709
9.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 17998
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 3774
21.0%
s 2014
11.2%
o 1760
9.8%
h 1760
9.8%
E 1419
 
7.9%
a 1419
 
7.9%
N 1203
 
6.7%
r 1203
 
6.7%
- 1142
 
6.3%
W 595
 
3.3%
Other values (3) 1709
9.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 17998
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 3774
21.0%
s 2014
11.2%
o 1760
9.8%
h 1760
9.8%
E 1419
 
7.9%
a 1419
 
7.9%
N 1203
 
6.7%
r 1203
 
6.7%
- 1142
 
6.3%
W 595
 
3.3%
Other values (3) 1709
9.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 17998
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 3774
21.0%
s 2014
11.2%
o 1760
9.8%
h 1760
9.8%
E 1419
 
7.9%
a 1419
 
7.9%
N 1203
 
6.7%
r 1203
 
6.7%
- 1142
 
6.3%
W 595
 
3.3%
Other values (3) 1709
9.5%

agePossession
Categorical

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size281.5 KiB
Relatively New
1646 
New Property
593 
Moderately Old
563 
Undefined
306 
Old Property
303 

Length

Max length18
Median length14
Mean length13.385912
Min length9

Characters and Unicode

Total characters49220
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUndefined
2nd rowRelatively New
3rd rowRelatively New
4th rowUndefined
5th rowUndefined

Common Values

ValueCountFrequency (%)
Relatively New 1646
44.8%
New Property 593
 
16.1%
Moderately Old 563
 
15.3%
Undefined 306
 
8.3%
Old Property 303
 
8.2%
Under Construction 266
 
7.2%

Length

2024-09-05T11:28:14.412288image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:14.758288image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
new 2239
31.8%
relatively 1646
23.4%
property 896
12.7%
old 866
 
12.3%
moderately 563
 
8.0%
undefined 306
 
4.3%
under 266
 
3.8%
construction 266
 
3.8%

Most occurring characters

ValueCountFrequency (%)
e 8431
17.1%
l 4721
 
9.6%
t 3637
 
7.4%
3371
 
6.8%
y 3105
 
6.3%
r 2887
 
5.9%
d 2307
 
4.7%
N 2239
 
4.5%
w 2239
 
4.5%
i 2218
 
4.5%
Other values (15) 14065
28.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 49220
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 8431
17.1%
l 4721
 
9.6%
t 3637
 
7.4%
3371
 
6.8%
y 3105
 
6.3%
r 2887
 
5.9%
d 2307
 
4.7%
N 2239
 
4.5%
w 2239
 
4.5%
i 2218
 
4.5%
Other values (15) 14065
28.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 49220
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 8431
17.1%
l 4721
 
9.6%
t 3637
 
7.4%
3371
 
6.8%
y 3105
 
6.3%
r 2887
 
5.9%
d 2307
 
4.7%
N 2239
 
4.5%
w 2239
 
4.5%
i 2218
 
4.5%
Other values (15) 14065
28.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 49220
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 8431
17.1%
l 4721
 
9.6%
t 3637
 
7.4%
3371
 
6.8%
y 3105
 
6.3%
r 2887
 
5.9%
d 2307
 
4.7%
N 2239
 
4.5%
w 2239
 
4.5%
i 2218
 
4.5%
Other values (15) 14065
28.6%

super_built_up_area
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct593
Distinct (%)31.6%
Missing1802
Missing (%)49.0%
Infinite0
Infinite (%)0.0%
Mean1925.2376
Minimum89
Maximum10000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:15.222302image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum89
5-th percentile767
Q11479.5
median1828
Q32215
95-th percentile3185
Maximum10000
Range9911
Interquartile range (IQR)735.5

Descriptive statistics

Standard deviation764.17218
Coefficient of variation (CV)0.39692356
Kurtosis10.349191
Mean1925.2376
Median Absolute Deviation (MAD)372
Skewness1.8364563
Sum3609820.5
Variance583959.12
MonotonicityNot monotonic
2024-09-05T11:28:15.674150image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1950 37
 
1.0%
1650 37
 
1.0%
2000 25
 
0.7%
1578 25
 
0.7%
1640 22
 
0.6%
2150 22
 
0.6%
2408 19
 
0.5%
1900 19
 
0.5%
1930 18
 
0.5%
1350 17
 
0.5%
Other values (583) 1634
44.4%
(Missing) 1802
49.0%
ValueCountFrequency (%)
89 1
< 0.1%
145 1
< 0.1%
161 1
< 0.1%
215 1
< 0.1%
216 1
< 0.1%
325 1
< 0.1%
340 1
< 0.1%
352 1
< 0.1%
380 1
< 0.1%
406 1
< 0.1%
ValueCountFrequency (%)
10000 1
< 0.1%
6926 1
< 0.1%
6000 1
< 0.1%
5800 2
0.1%
5514 1
< 0.1%
5350 2
0.1%
5200 2
0.1%
4890 1
< 0.1%
4857 1
< 0.1%
4848 2
0.1%

built_up_area
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED 

Distinct644
Distinct (%)38.1%
Missing1987
Missing (%)54.0%
Infinite0
Infinite (%)0.0%
Mean2379.5858
Minimum2
Maximum737147
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:16.341470image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile240.45
Q11100
median1650
Q32400
95-th percentile4691
Maximum737147
Range737145
Interquartile range (IQR)1300

Descriptive statistics

Standard deviation17942.88
Coefficient of variation (CV)7.5403375
Kurtosis1667.8704
Mean2379.5858
Median Absolute Deviation (MAD)650
Skewness40.706572
Sum4021500
Variance3.2194695 × 108
MonotonicityNot monotonic
2024-09-05T11:28:16.776578image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1800 41
 
1.1%
3240 37
 
1.0%
1900 34
 
0.9%
1350 33
 
0.9%
2700 33
 
0.9%
900 28
 
0.8%
1600 26
 
0.7%
1300 24
 
0.7%
2000 24
 
0.7%
1700 23
 
0.6%
Other values (634) 1387
37.7%
(Missing) 1987
54.0%
ValueCountFrequency (%)
2 1
 
< 0.1%
14 1
 
< 0.1%
30 1
 
< 0.1%
33 1
 
< 0.1%
50 3
0.1%
53 1
 
< 0.1%
55 1
 
< 0.1%
56 1
 
< 0.1%
57 1
 
< 0.1%
60 5
0.1%
ValueCountFrequency (%)
737147 1
 
< 0.1%
13500 1
 
< 0.1%
11286 1
 
< 0.1%
9500 1
 
< 0.1%
9000 7
0.2%
8775 1
 
< 0.1%
8286 1
 
< 0.1%
8067.8 1
 
< 0.1%
8000 1
 
< 0.1%
7500 2
 
0.1%

carpet_area
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED 

Distinct733
Distinct (%)39.2%
Missing1805
Missing (%)49.1%
Infinite0
Infinite (%)0.0%
Mean2529.1795
Minimum15
Maximum607936
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:17.215327image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile350
Q1843
median1300
Q31790
95-th percentile2950
Maximum607936
Range607921
Interquartile range (IQR)947

Descriptive statistics

Standard deviation22799.836
Coefficient of variation (CV)9.0147166
Kurtosis604.53764
Mean2529.1795
Median Absolute Deviation (MAD)472.5
Skewness24.333239
Sum4734624
Variance5.1983254 × 108
MonotonicityNot monotonic
2024-09-05T11:28:17.655103image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1400 42
 
1.1%
1800 35
 
1.0%
1600 35
 
1.0%
1200 31
 
0.8%
1500 29
 
0.8%
1650 28
 
0.8%
1350 27
 
0.7%
1300 23
 
0.6%
1000 22
 
0.6%
1450 22
 
0.6%
Other values (723) 1578
42.9%
(Missing) 1805
49.1%
ValueCountFrequency (%)
15 1
 
< 0.1%
33 1
 
< 0.1%
48 1
 
< 0.1%
50 1
 
< 0.1%
59 1
 
< 0.1%
60 1
 
< 0.1%
66 1
 
< 0.1%
72 1
 
< 0.1%
76.44 3
0.1%
77.31 1
 
< 0.1%
ValueCountFrequency (%)
607936 1
< 0.1%
569243 1
< 0.1%
514396 1
< 0.1%
64529 1
< 0.1%
64412 1
< 0.1%
58141 1
< 0.1%
54917 1
< 0.1%
48811 1
< 0.1%
45966 1
< 0.1%
34401 1
< 0.1%

study room
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size237.0 KiB
0
2972 
1
705 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3677
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2972
80.8%
1 705
 
19.2%

Length

2024-09-05T11:28:18.043259image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:18.349262image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 2972
80.8%
1 705
 
19.2%

Most occurring characters

ValueCountFrequency (%)
0 2972
80.8%
1 705
 
19.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 2972
80.8%
1 705
 
19.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 2972
80.8%
1 705
 
19.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 2972
80.8%
1 705
 
19.2%

servant room
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size237.0 KiB
0
2349 
1
1328 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3677
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2349
63.9%
1 1328
36.1%

Length

2024-09-05T11:28:18.679549image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:18.984855image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 2349
63.9%
1 1328
36.1%

Most occurring characters

ValueCountFrequency (%)
0 2349
63.9%
1 1328
36.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 2349
63.9%
1 1328
36.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 2349
63.9%
1 1328
36.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 2349
63.9%
1 1328
36.1%

store room
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size237.0 KiB
0
3339 
1
338 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3677
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 3339
90.8%
1 338
 
9.2%

Length

2024-09-05T11:28:19.300363image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:19.599003image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 3339
90.8%
1 338
 
9.2%

Most occurring characters

ValueCountFrequency (%)
0 3339
90.8%
1 338
 
9.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3339
90.8%
1 338
 
9.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3339
90.8%
1 338
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3339
90.8%
1 338
 
9.2%

pooja room
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size237.0 KiB
0
3021 
1
656 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3677
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 3021
82.2%
1 656
 
17.8%

Length

2024-09-05T11:28:19.927952image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:20.226220image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 3021
82.2%
1 656
 
17.8%

Most occurring characters

ValueCountFrequency (%)
0 3021
82.2%
1 656
 
17.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3021
82.2%
1 656
 
17.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3021
82.2%
1 656
 
17.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3021
82.2%
1 656
 
17.8%

others
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size237.0 KiB
0
3272 
1
405 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3677
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 3272
89.0%
1 405
 
11.0%

Length

2024-09-05T11:28:20.564681image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:20.893766image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 3272
89.0%
1 405
 
11.0%

Most occurring characters

ValueCountFrequency (%)
0 3272
89.0%
1 405
 
11.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3272
89.0%
1 405
 
11.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3272
89.0%
1 405
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3272
89.0%
1 405
 
11.0%

furnishing_type
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size237.0 KiB
0
2436 
1
1038 
2
 
203

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3677
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2436
66.2%
1 1038
28.2%
2 203
 
5.5%

Length

2024-09-05T11:28:21.216667image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-05T11:28:21.523127image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 2436
66.2%
1 1038
28.2%
2 203
 
5.5%

Most occurring characters

ValueCountFrequency (%)
0 2436
66.2%
1 1038
28.2%
2 203
 
5.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 2436
66.2%
1 1038
28.2%
2 203
 
5.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 2436
66.2%
1 1038
28.2%
2 203
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3677
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 2436
66.2%
1 1038
28.2%
2 203
 
5.5%

luxury_score
Real number (ℝ)

ZEROS 

Distinct161
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71.512918
Minimum0
Maximum174
Zeros462
Zeros (%)12.6%
Negative0
Negative (%)0.0%
Memory size57.5 KiB
2024-09-05T11:28:21.885767image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q131
median59
Q3110
95-th percentile174
Maximum174
Range174
Interquartile range (IQR)79

Descriptive statistics

Standard deviation53.059082
Coefficient of variation (CV)0.74195102
Kurtosis-0.88020421
Mean71.512918
Median Absolute Deviation (MAD)38
Skewness0.4590463
Sum262953
Variance2815.2662
MonotonicityNot monotonic
2024-09-05T11:28:22.323725image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 462
 
12.6%
49 348
 
9.5%
174 195
 
5.3%
44 60
 
1.6%
38 55
 
1.5%
165 55
 
1.5%
72 52
 
1.4%
60 47
 
1.3%
42 45
 
1.2%
37 45
 
1.2%
Other values (151) 2313
62.9%
ValueCountFrequency (%)
0 462
12.6%
5 6
 
0.2%
6 6
 
0.2%
7 41
 
1.1%
8 30
 
0.8%
9 9
 
0.2%
12 6
 
0.2%
13 10
 
0.3%
14 12
 
0.3%
15 43
 
1.2%
ValueCountFrequency (%)
174 195
5.3%
169 1
 
< 0.1%
168 9
 
0.2%
167 21
 
0.6%
166 10
 
0.3%
165 55
 
1.5%
161 3
 
0.1%
160 28
 
0.8%
159 23
 
0.6%
158 34
 
0.9%

Interactions

2024-09-05T11:27:55.816582image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:27.805329image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:30.849371image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:34.027877image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:37.134475image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:40.476017image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:43.596165image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:46.626437image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:49.546384image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:52.790220image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:56.121144image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:28.128366image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:31.153636image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:34.356414image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:37.438519image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:40.787288image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:43.891621image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:46.899179image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:49.857240image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:53.077319image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:56.424976image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:28.431682image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:31.457337image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:34.644772image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:37.766935image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:41.093945image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:44.187917image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:47.204199image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:50.173846image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:53.383027image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:56.711548image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:28.714554image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:31.753145image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:34.916548image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:38.055270image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:41.390697image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:44.468355image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:47.483731image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:50.453881image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:53.679392image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:57.040912image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:29.031737image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:32.082027image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:35.306242image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:38.384553image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:41.718885image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:44.796742image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:47.811381image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:50.791198image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:53.982958image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:57.387802image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:29.361545image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:32.451052image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:35.635884image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:38.707570image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:42.048898image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:45.110322image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:48.108365image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:51.129871image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:54.296982image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:57.683407image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:29.656849image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:32.753920image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:35.931834image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:39.210865image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:42.361842image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:45.415306image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:48.371795image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:51.617579image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:54.583638image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:57.964985image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:29.929276image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:33.041197image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:36.210891image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:39.515111image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:42.650721image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:45.711765image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:48.666531image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:51.888582image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:54.886721image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:58.295836image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:30.241419image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:33.354173image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:36.507193image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:39.851836image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:42.970801image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:46.025493image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:48.928832image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:52.200976image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:55.158595image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:58.599179image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:30.546170image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:33.666256image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:36.835221image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:40.157126image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:43.274827image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:46.321360image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:49.240877image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:52.473633image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-09-05T11:27:55.454377image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Correlations

2024-09-05T11:28:22.684612image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
agePossessionareabalconybathroombedRoombuilt_up_areacarpet_areafacingfloorNumfurnishing_typeluxury_scoreotherspooja roompriceprice_per_sqftproperty_typeservant roomstore roomstudy roomsuper_built_up_area
agePossession1.0000.0000.2740.1110.1300.0000.0000.0920.1250.2140.2550.1080.1870.1020.0560.3790.2870.1430.1400.086
area0.0001.0000.0110.6870.6240.8350.8010.0220.1160.0430.2590.0420.0370.7440.2070.0280.0150.0390.0180.948
balcony0.2740.0111.0000.2250.1760.0000.0260.0160.0790.1780.2230.0820.1970.1360.0330.2140.4410.1460.1830.306
bathroom0.1110.6870.2251.0000.8620.4650.5990.044-0.0050.1950.1790.0700.2860.7200.4110.4720.5200.2440.1760.819
bedRoom0.1300.6240.1760.8621.0000.3800.5690.032-0.1040.1660.0570.0790.2910.6810.4170.5950.3170.2230.1540.800
built_up_area0.0000.8350.0000.4650.3801.0000.9691.0000.0910.0900.2890.0000.0000.6050.1320.0000.0000.0000.0000.926
carpet_area0.0000.8010.0260.5990.5690.9691.0000.0000.1590.0000.2390.0160.0000.6130.1360.0000.0000.0000.0030.894
facing0.0920.0220.0160.0440.0321.0000.0001.0000.0000.0550.0650.0000.0290.0210.0000.0940.0360.0360.0000.000
floorNum0.1250.1160.079-0.005-0.1040.0910.1590.0001.0000.0260.2320.0330.1020.001-0.1260.4850.0840.1120.0780.152
furnishing_type0.2140.0430.1780.1950.1660.0900.0000.0550.0261.0000.2380.0640.2130.1740.0220.0850.2660.1560.1380.132
luxury_score0.2550.2590.2230.1790.0570.2890.2390.0650.2320.2381.0000.1760.1890.2150.0540.3290.3470.2280.1830.222
others0.1080.0420.0820.0700.0790.0000.0160.0000.0330.0640.1761.0000.0330.0340.0360.0260.0000.1060.0310.084
pooja room0.1870.0370.1970.2860.2910.0000.0000.0290.1020.2130.1890.0331.0000.3340.0430.2520.2520.3050.3130.157
price0.1020.7440.1360.7200.6810.6050.6130.0210.0010.1740.2150.0340.3341.0000.7440.5430.3690.3030.2440.772
price_per_sqft0.0560.2070.0330.4110.4170.1320.1360.000-0.1260.0220.0540.0360.0430.7441.0000.2010.0440.0000.0300.287
property_type0.3790.0280.2140.4720.5950.0000.0000.0940.4850.0850.3290.0260.2520.5430.2011.0000.0650.2410.1281.000
servant room0.2870.0150.4410.5200.3170.0000.0000.0360.0840.2660.3470.0000.2520.3690.0440.0651.0000.1610.1850.584
store room0.1430.0390.1460.2440.2230.0000.0000.0360.1120.1560.2280.1060.3050.3030.0000.2410.1611.0000.2260.046
study room0.1400.0180.1830.1760.1540.0000.0030.0000.0780.1380.1830.0310.3130.2440.0300.1280.1850.2261.0000.121
super_built_up_area0.0860.9480.3060.8190.8000.9260.8940.0000.1520.1320.2220.0840.1570.7720.2871.0000.5840.0460.1211.000

Missing values

2024-09-05T11:27:59.100707image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
A simple visualization of nullity by column.
2024-09-05T11:28:00.166441image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-09-05T11:28:00.872267image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

property_typesocietysectorpriceprice_per_sqftareaareaWithTypebedRoombathroombalconyfloorNumfacingagePossessionsuper_built_up_areabuilt_up_areacarpet_areastudy roomservant roomstore roompooja roomothersfurnishing_typeluxury_score
0flatumang monsoon breezesector 780.756053.01239.0Built Up area: 1239 (115.11 sq.m.)Carpet area: 1100 sq.ft. (102.19 sq.m.)2215.0EastUndefinedNaN1239.01100.00000000
1flatireo skyonsector 602.1523344.0921.0Super Built up area 1524(141.58 sq.m.)Built Up area: 1250 sq.ft. (116.13 sq.m.)Carpet area: 921 sq.ft. (85.56 sq.m.)2325.0EastRelatively New1524.01250.0921.001000149
2flatdlf regal gardenssector 901.237052.01744.0Super Built up area 1744(162.02 sq.m.)33219.0NorthRelatively New1744.0NaNNaN0000106
3houseindependentsector 25.0015384.03250.0Carpet area: 3250 (301.93 sq.m.)6621.0SouthUndefinedNaNNaN3250.00000000
4flatdlf the arboursector 637.5219000.03958.0Built Up area: 3956 (367.52 sq.m.)Carpet area: 2200 sq.ft. (204.39 sq.m.)44315.0WestUndefinedNaN3956.02200.000000061
5housess hibiscussector 5012.5040850.03060.0Plot area 470(392.98 sq.m.)Built Up area: 410 sq.yards (342.81 sq.m.)Carpet area: 340 sq.yards (284.28 sq.m.)443+2.0WestRelatively NewNaN410.0340.0110002160
6flatsuncity vatsal valleygwal pahari1.3812212.01130.0Built Up area: 1130 (104.98 sq.m.)2224.0EastUnder ConstructionNaN1130.0NaN000000133
7houseansals florence villasector 573.6510579.03450.0Plot area 3450(320.52 sq.m.)6623.0North-WestModerately OldNaN3450.0NaN111102152
8houseindependentsector 145.5016975.03240.0Plot area 360(301.01 sq.m.)Built Up area: 355 sq.yards (296.83 sq.m.)Carpet area: 300 sq.yards (250.84 sq.m.)5422.0EastOld PropertyNaN355.0300.010000027
9flatgodrej nature plussector 331.107971.01380.0Carpet area: 1380 (128.21 sq.m.)22214.0NorthUndefinedNaNNaN1380.000000056
property_typesocietysectorpriceprice_per_sqftareaareaWithTypebedRoombathroombalconyfloorNumfacingagePossessionsuper_built_up_areabuilt_up_areacarpet_areastudy roomservant roomstore roompooja roomothersfurnishing_typeluxury_score
3793houseindependentsector 40.61100000.061.0Plot area 61(5.67 sq.m.)2222.0South-EastModerately OldNaN61.0NaN00001014
3794flatpyramid pridesector 760.506578.0760.0Carpet area: 760 (70.61 sq.m.)2218.0NorthUnder ConstructionNaNNaN760.000000076
3795flatm3m golfestatesector 654.9515906.03112.0Super Built up area 3112(289.11 sq.m.)33310.0NorthRelatively New3112.0NaNNaN01000182
3796flatconscient elevatesector 594.0217500.02297.0Carpet area: 2295 (213.21 sq.m.)3313.0NaNUnder ConstructionNaNNaN2295.000000031
3797houseprivate housesector 557.0546906.01503.0Plot area 167(139.63 sq.m.)18183+4.0North-EastRelatively NewNaN1503.0NaN00001257
3798flatla vida by tata housingsector 1133.7213984.02660.0Carpet area: 2660 (247.12 sq.m.)543+0.0NaNNew PropertyNaNNaN2660.000010244
3799flatramprastha citydwarka expressway1.309285.01400.0Super Built up area 1400(130.06 sq.m.)3323.0NaNUnder Construction1400.0NaNNaN0000000
3800flatthe close northsector 502.6510685.02480.0Super Built up area 2480(230.4 sq.m.)Built Up area: 2470 sq.ft. (229.47 sq.m.)Carpet area: 2000 sq.ft. (185.81 sq.m.)343+6.0NorthModerately Old2480.02470.02000.0010001174
3801flatdlf new town heightssector 861.306735.01930.0Super Built up area 1930(179.3 sq.m.)33312.0WestModerately Old1930.0NaNNaN010100106
3802houseindependentsohna road1.5514720.01053.0Plot area 117(97.83 sq.m.)2321.0NorthNew PropertyNaN1053.0NaN10010149